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Abstract. Three-point semidefinite programming bounds are one of the most 
powerful known tools for bounding the size of spherical codes. In this paper, we 
use them to prove lower bounds for the potential energy of particles interacting 
via a pair potential function. We show that our bounds are sharp for seven 
points in RP^. Specifically, wc prove that the seven lines connecting opposite 
vertices of a cube and of its dual octahedron are universally optimal. (In other 
words, among all configurations of seven lines through the origin, this one 
minimizes energy for all potential functions that are completely monotonic 
functions of squared chordal distance.) This configuration is the only known 
universal optimum that is not distance regular, and the last remaining universal 
optimum in RP-^. We also give a new derivation of semidefinite programming 
bounds and present several surprising conjectures about them. 



1. Introduction 

Consider the seven lines connecting opposite vertices of a cube and of its dual 
octahedron. Although the symmetry group docs not act transitively on the lines, 
they are exceedingly well distributed within MP^ . In this paper, we prove that they 
form a universally optimal configuration; in other words, they minimize a wide 
variety of natural notions of energy. Universal optima are rare, and we show that 
this configuration is the largest universal optimum in MP^. 

To prove universal optimality, we use semidefinite programming bounds, which are 
a powerful technique for proving bounds in coding theory. We give a new derivation 
of these bounds and extend them from coding to energy minimization. Proving 
universal optimality involves challenges that have not arisen in previous applications 
of semidefinite programming bounds, and wc provide a general methodology for 
meeting these challenges. Furthermore, we conjecture that in certain other cases our 
bounds remain sharp throughout a phase transition between two different ground 
states, which would be a remarkable phenomenon. We have not yet been able to 
prove these conjectures, but the techniques we introduce here represent the first 
steps in a program to do so. 

1.1. Background. How can one distribute N points as uniformly as possible in 
a compact metric space X with metric d? There are many possible answers, such 
as forming a good error-correcting code, i.e., maximizing the distance between the 
closest points. One particularly important family of answers generalizing coding 
theory is given by potential energy minimization. Given a decreasing, continuous 
function / : (O, max^^y^x d{x, y)^] — M called the potential function, define the 
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energy of a finite configuration C C X hy 

x,y£C 

Note that the use of squared distance instead of distance is not standard in physics, 
but it is mathematically convenient (and of course there is no loss of generality). 
We wish to choose C so as to minimize Ef(C) subject to \C\ — N . Because / is 
decreasing, this amounts to moving the points as far apart as possible, but all the 
distances matter, not just the minimal distance. 

This energy minimization problem arises naturally in physics, as the problem 
of determining the ground state of a classical particle system with isotropic pair 
interactions. Even in the case of particles confined to two-dimensional surfaces, these 
models capture important features of many real-world materials, such as colloidal 
particles adsorbing to the surface of a droplet (see |BG09| for details and other 
examples) . 

Small systems frequently display beautiful symmetry, such as twelve particles on 
a sphere forming the vertices of an icosahedron, but in larger systems the symmetry 
is often broken by the appearance of defects. One might expect that these defects 
would occur only in local minima for energy, and that the global minimum would 
have a perfect crystalline structure. Sometimes that is the case, but in many systems 
the defects actually contribute to lowering energy. (See for example the discussion 
of spherical crystals in Section 3 of |BG09| .) One of the fundamental problems in 
this area is understanding when highly symmetrical configurations are optimal and 
why ^ClO) . Even when the answer is easy to guess, it is usually not easy to prove, 
and guessing the answer can itself be tricky. 

Energy minimization can also be viewed as a generalization of coding theory. 
Specifically, it includes coding theory as a degenerate special case: if we take 
/(r) = 1/?"^, then as s — > oo, the problem of minimizing Ef{C) turns into the problem 
of maximizing the minimal distance in C. The same problem of understanding 
symmetry occurs here as well: when should one expect an optimal code to be highly 
symmetrical? Large or high-dimensional codes are frequently much less highly 
structured than small codes. 

Linear programming bounds are a powerful technique for proving lower bounds 
for energy. In the simplest cases, they deal with the distance distribution of the 
configuration (in physics terms, the radial pair correlation function), which counts 
how many times each distance occurs between pairs of points. In other words, the 
distance distribution of C is the function S: [0,oo) — > M defined by 

S{r) ^\{x,y eC : d{x, y) = r}\ . 

Clearly S{0) = |C|, S{r) > for aU r, and 'E^Sir) = |Cp (note that only finitely 
many terms are nonzero) , but we will see that these are far from the only constraints 
on 5. The distance distribution plays a key role because energy is a linear function 
of (5: 

r>Q 

By contrast, the underlying configuration C is not always determined by S (see 

EMl). 
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Delsarte |D72j realized that in addition to the obvious constraints on S mentioned 
above, there are often many other hnear constraints. Using these constraints, one 
can derive bounds on energy via hnear programming, because one is optimizing 
a hnear function of 5 subject to hnear constraints. This was first carried out for 
energy minimization by Yudin |Y92| . We refer to them as two-point bounds because 
they depend only on the distances between pairs of points. Linear programming 
bounds are generally not sharp, or even close to sharp, but in certain cases they 
are unexpectedly powerful. For example, Cohn and Kumar jCK07] used linear 
programming bounds to prove that a number of exceptional structures in spheres 
or projective spaces are universally optimal. In other words, these configurations 
minimize energy for all completely monotonic functions of squared Euclidean distance 
on spheres or squared chordal distance in projective space. (Recall that a completely 
monotonic function is a smooth, nonnegative function whose derivatives alternate in 
sign: it is decreasing, convex, etc. For example, inverse power laws are completely 
monotonic.) Examples include the vertices of any regular polytope with simplicial 
facets, the Eg root system, or the minimal vectors in the Leech lattice. 

It is surprising that linear programming bounds are ever sharp, because when 
they are sharp, pair distance information alone suffices to identify the true ground 
state. As one might expect, that is very rarely the case. To prove stronger bounds, 
it is natural to try to take into account triples as well as pairs (i.e., how many times 
each triangle of distances occurs between three points), and Schrijver |S05| found 
the right approach. The constraints are no longer linear, but rather semidefinite. 
Bachoc and Vallentin |EV08| developed a representation-theoretic explanation and 
extended the method from binary codes to spheres, with an approach that applies 
also to more general spaces (and this method was further developed by Musin M07 ). 
These three-point semidefinite programming bounds are one of the most powerful 
general tools known for proving bounds in coding theory. Using them, Bachoc and 
Vallentin lBV09bj determined the optimal 10-point code in S^, which was the first 
new optimality proof for a spherical code in decades. They also conjectured that the 
bounds are sharp for the optimal 8-point code in (a square antiprism, first proved 
optimal by Schiitte and van der Waerden |SW51| . so there was less motivation to 
verify the sharpness in this case). 

1.2. Our results. In this paper, we give a new derivation of the semidefinite 
programming bounds, which has the advantage of requiring no explicit calculation 
beyond what is necessary for the linear programming bounds. We then prove 
semidefinite bounds for potential energy minimization. Using three-point bounds, we 
prove universal optimality for a seven-point code in MP^. It is given by the seven lines 
through opposite vertices of a cube and its dual octahedron. Equivalently, the lines 
connect opposite vertices of a rhombic dodecahedron. In the dual picture of planes 
through the origin in , the planes are parallel to the facets of a cuboctahedron 
(the dual polyhedron to the rhombic dodecahedron). 

Theorem 1.1. The rhombic dodecahedron code is universally optimal in MP^. It 
is globally minimal for energy for each completely monotonic potential function of 
squared chordal distance, and it is the unique global minimum unless the potential 
function is a linear function. It is also the unique optimal seven-point code in MP^. 

It is straightforward to check that two-point bounds cannot prove Theorem 
It was already known that this configuration is an optimal projective code (i.e.. 
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it maximizes the minimal distance), as a consequence of the orthoplex bound of 
Conway, Hardin, and Sloane |CHS96j . Uniqueness was conjectured in |CHS96j but 
does not foUow from the orthoplex bound; see Appendix for an explanation. 

Note that the uniqueness asserted in Theorem |1.1| is as strong as possible: for 
linear potential functions, uniqueness fails because one can rotate the cube and 
octahedron relative to each other (so they are no longer in dual position) without 
changing the energy. That can be checked directly, but conceptually it holds because 
both polyhedra already define projective 1-designs. More generally, fc-designs 
automatically minimize potential energy for completely monotonic polynomials of 
degree at most k. 

One noteworthy aspect of the rhombic dodecahedron code is that it is the first 
case in which linear or semidefinite programming bounds are sharp for a code that is 
not distance regular. In other words, there is a distance such that the points in the 
code do not all have the same number of neighbors at that distance. All universal 
optima known or conjectured so far have been distance regular, so this example is 
the least regular one known. Theorem 1 1 . 1 1 thus extends the types of configurations 
that can be analyzed rigorously. 

We have searched extensively for other cases in which the three-point semidefinite 
programming bounds are sharp (in spheres, real projective spaces, and complex 
projective spaces), but the rhombic dodecahedron is the only new case we have 
found. It would be very surprising if it were the only example in projective space, 
and we think there must be others. However, our failure to find them suggests 
that they are either large or high-dimensional. In the process of searching for other 
examples, we have formulated several remarkable conjectures about semidefinite 



programming bounds (Conjectures 5.2 through 5.4): in the few sharp cases that 
are known, the bounds appear to remain sharp even all the way through a phase 
transition between two different structures. 

Theorem 1 1 . 1 1 provides the last remaining universal optimum in MP^: 

Theorem 1.2. The complete list of universally optimal line configurations in 
(up to isometry) is as follows: 

(1) Up to three orthogonal lines. 

(2) The four lines through opposite vertices of a cube. 

(3) The six lines through opposite vertices of an icosahedron. 

(4) The seven lines through opposite vertices of a rhombic dodecahedron. 

Case ([T]) is trivial, while cases ^ and ([3| follow from Theorem 8.2 in |CK07| . 
Case Q is Theorem and the completeness of the list is proved in Appendix |Bj 



1.3. Notation and definitions. A code is simply a finite subset of a metric space. 
It is optimal if it maximizes the minimal distance between points, given the number 
of points and the metric space. 

An (n, N, t) spherical code is a set of N points in the unit sphere S*""^ such that 
the maximal inner product between distinct points is at most t. (In other words, the 
minimal angle between them is at least cos~^ t.) An antipodal code is a spherical 
code that is closed under multiplication by —1. Real projective codes are of course 
equivalent to antipodal codes, and the rhombic dodecahedron code corresponds to a 
(3, 14, l/VS) antipodal code. (Note that the vertices of the rhombic dodecahedron 
do not lie on a common sphere, so they must be rescaled to form this code.) 
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The chordal metric is defined on real, complex, or quaternionic projective space 
as follows (in the octonionic case it is defined using the Frobenius norm on the 
exceptional Jordan algebra; see |CK07[ p. 130]). If we represent points in projective 
space by unit vectors, then the chordal distance between x and y is -^/l — |(a;, y)p, 
where {x, y) is the Hermitian inner product. This metric, first introduced in |CHS96| . 
is equivalent to the Fubini-Study metric but is in many ways more convenient. 

A code in a sphere or projective space is universally optimal if it minimizes the 
energy Ef for all completely monotonic potential functions / (compared to all other 
codes of the same size) , with the metric chosen to be the Euclidean metric on spheres 
or the chordal metric on projective spaces. Recall that distance is squared in the 
definition of Ef. This strengthens the notion of universal optimality, compared to 
defining it with unsquared distance, and it improves the connections with topics 
such as spherical or projective designs. (See |CK07i p. 101].) 

Let Pk{t) denote the degree k ultraspherical (i.e., Gegenbauer) polynomial for 
5'""^, normalized with P^(l) — 1. (This is not the most common notation.) These 
polynomials are orthogonal with respect to the measure (1 — t^)("~'^)/^(ii on [—1, 1]. 
See Chapters 6 and 9 of |AAR99| for more details. 

In the literature, two-point bounds are typically referred to as linear programming 
bounds and three-point bounds as semidefinite programming bounds. These names 
are a reasonable reference to the underlying inequalities, but we propose calling 
them "fc-point bounds," which focuses attention on the most geometrically relevant 
feature, namely the number of points being simultaneously considered. 

Given a matrix A, we write A ^ if A is Hermitian (i.e., it equals its own 
conjugate transpose) and positive semidefinite. The inner product on Hermitian 
matrices is defined by {A,B) = tr^AB) (i.e., the entry-by-entry Hermitian inner 
product), and angle brackets will also denote the usual inner product on vectors. 
We will use J to denote a matrix with all entries 1. 

When we deal with points in real projective space MP""^, we will represent each 
point by an arbitrary lift to the double cover S"~^. That introduces sign ambiguities, 
so we must ensure that all formulas are invariant under the choice of the lift. For 
example, if we apply a function to the inner product between two points, then it 
should be an even function. 

2. Linear and semidefinite programming bounds 

2.1. Positive-definite kernels. Linear and semidefinite programming bounds are 
based on the theory of positive-definite kernels. Let G be a topological group acting 
continuously on a topological space X. Call a continuous function K : X x AT — C a 
positive- definite kerne^if for all A'^ G N and all Xi, . . . , xjv S X, the N x N matrix 
{K{xi, Xj))i<ij<N is Hermitian and positive semidefinite. (Of course, saying it is 
Hermitian simply means that K{x, y) = K{y, x) for all x,y E X.) We call the kernel 
G-invariant if K{gx,gy) = K{x,y) for all g S G and x,y E X. 

Given any unitary representation V oi G with inner product (•, •) and any 
continuous map Lp: X -^V that is G-equivariant (i.e., (p{gx) = g^p{x) for g G G and 
X € X), setting K{x,y) = {(p{x) , ip{y)) defines a G-invariant positive-definite kernel 
on X. In fact, every such kernel arises in this way: 



In an ideal world, we would call K a positive-semidefinite kernel, but unfortunately this 
terminology is too well established to be easily changed. 
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Theorem 2.1. For every G-invariant positive-definite kernel K on X , there exists 
a unitary representation V of G and a continuous, G-equivariant map ip: X 
such that K{x, y) ~ {if{x), >f{y)) for all x,y Cz X . 

This theorem is a variant of a characterization of positive-definite kernels due 
to Bochner |B41| . See, for example, §5 of Chapter IV in [L85) for a closely related 
result (with the same idea behind it). 

Proof. Let W be the complex vector space formally spanned by the points of X, 
and define a form (•, •) on W by setting {x, y) = K{x, y) for x,y € X and extending 
linearly in the first coordinate and conjugate linearly in the second. The action of 
G on X extends to an action of G on 14^. 

Because K is a, G-invariant positive-definite kernel, the form (•, •) on W is G- 
invariant and positive semidefinite. Let Wq be the subspace of vectors that have 
norm 0. Then the inner product on W/Wq is positive definite, and the completion V 
of W/Wq is a Hilbert space. Let (p: X ^ V he the obvious map (the composition of 
the trivial embedding of X in W, the quotient map by Wq, and the embedding in the 
completion). Then ip is continuous and G-equivariant, and K{x,y) = {ip{x) , (p{y)) 
for all x,y & X. 

To complete the proof, we just need to verify that y is a unitary representation of 
G. In other words, for each v ^ V, the map g H> gv must be a continuous function 
from G to V. By G-invariance it suffices to verify continuity at g = 1. Thus, we wish 
to show that — >■ w as g — ?> 1. Furthermore, it suffices to show this convergence 
for a dense subset of V, and we choose W/Wq as that subset. Therefore, we can 
assume that 

xex 

where [x] denotes the vector in V corresponding to x £ X and where all but finitely 
many of the coefficients vanish. Then 

\gv-v\'^^ ^ c^c^{2K{x,y) - K{x,gy) - K{gx,y)). 

x.yeX 

By the continuity of K and the action of G on X, the finitely many terms on the 
right side with nonvanishing coefficients can be made arbitrarily small by making g 
close to the identity. Thus, V is indeed a unitary representation of G. □ 

If V splits into an orthogonal direct sum 1/ = 14 ffl V2 as a representation of 
G, then each corresponding kernel K splits as K — Ki + K2, where Ki and K2 
correspond to the subrepresentations Vi and V2. 

Suppose that G is compact. Then by the Peter- Weyl theorem, every unitary 
representation of G is an orthogonal direct sum of finite-dimensional irreducible 
representations. Thus, the cone of G-invariant positive-definite kernels is spanned 
by those arising from irreducible representations. 

When G acts transitively on X, we can identify X with G/H, where H is the 
stabilizer of a point e £ X. Then a G-equivariant map ip: X ^ V is completely 
determined by (p{e) via 'p{ge) = gip(e), and >p{e) can be any vector in V that is 
fixed by H. 

The simplest case is when (G, H) is a Gelfand pair: then the fixed space has 
dimension at most 1 when V is irreducible, so each irreducible representation is 
associated with at most one positive-definite kernel (up to scaling). This occurs, 
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for example, when X is a sphere, projective space, or Grassmannian and G is the 
group of isometrics of X. 

When the fixed space has dimension greater than 1, the situation is more 
compUcated. Then V is associated with several positive-definite kernels, but in fact 
there is a richer structure behind them. For Xi,X2 G X, define a sesquilinear form 
ii'a;i,x2 on as follows. Let a;, = gie with (71,32 G G, and for v\,V2 S define 

The key property of these forms is that for all N G N, xx,...,xn G X, and 
v\,...,VNe V" , 

l<i,j<N 

To prove this inequality, note that 



Kx,,xj{vi,Vj) = {giVi,gjVj) 

l<i,j<N l<i,j<N 



l<i<Ar 



hVi 



As a consequence (by taking vi = ■ ■ ■ = vn), for all S N and Xi, . . . , Xn & X, 

l<i,j<JV 

where means that A is positivc-scmidcfinitc and Hcrmitian. The G-invariant 
positive-definite kernels are given by (x,y) Kx,y{v,v) with v € , but the 
underlying structure of K is of broader importance. 

In general, define a matrix- valued positive-definite kernel to be a map that takes 
x,y € X to a, sesquilinear form K^.y on a fixed complex vector space W (not 
necessarily finite-dimensional), with the following properties: for all v,w G W, the 
map {x,y) i-)- Kx,y{v,w) is continuous, for all x,y G X and v,w G W, 

and for all A' e N and all a;i, . . . , xn € X and wi,. . . , wn G W, 

l<i,j<N 

The last two properties are equivalent to requiring that 

We say K is defined on X and over W. The matrix-valued kernel is G-invariant 
if Kgx^gy = Kx^y for all G G and x,y G X. The construction above defines a G- 
invariant matrix-valued positive-definite kernel over whenever G acts transitively 
on X. Note that when dim W = 1, a matrix- valued positive-definite kernel over W 
is the same as an ordinary (scalar-valued) kernel. 

When G does not act transitively on X, the identification of the vector space 
Home (A, V) of continuous, G-equivariant maps with breaks down. Nevertheless, 
essentially the same construction works. One can define a G-invariant matrix- valued 
positive-definite kernel over Hom^ {X, V) by setting 

Kx^^X2{'^l,'^2) = {Lpi{xi),ip2{x2)) 

for Xi,X2G X and </'i)<^2 S Hom(3(X, y). This construction is fully general, aside 
from changing variables in trivial ways: 
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Theorem 2.2. For every G-invariant matrix-valued positive-definite kernel K on 
X and over W , there exists a unitary representation V of G and a linear function 
ip: W ^ ilomQ{X, V) (written w M- ip^) such that for all x,y ^ X and u, w G W , 

K^^y{v,w) = {ip^{x),ipw{y)). 

Proof. Let U = W (S)c CX, where CX denotes the complex vector space formaUy 
spanned by the points of X. We define a form (•, •) on f7 by 

('^Vi®Xi,'^Wj(^Xj) = ^ K^^^^. {vi,vuj) 

\ i j I ij 

for Vi,Wi £ W and Xi e X. Because is a matrix-valued positive-definite kernel, 
this defines a Hermitian, positive-semidefinite form on U. Let Uq denote the subspace 
of vectors with norm 0, and let V be the completion of U/Uq, so F is a Hilbert 
space. Define ^w{x) to be the image ofw(^xmV. 

The Hilbert space V is a unitary representation of V, with the trivial action of 
G on and the usual action on X. (Strong continuity follows as in the proof of 



Theorem 2.1 ) By construction, 

K^^y{v,w) = {(p^{x),(fnj{y)), 
which completes the proof. □ 

When X = S*"^^ and G is the stabilizer in 0{n) of a point in X, the matrices 
constructed in Theorem 3.1 of [BV08J (by a different method from that used here) 
define matrix-valued positive-definite kernels, and we were led to our construction in 
an attempt to develop a more abstract approach to Bachoc and Vallentin's discovery. 

2.2. Linear and semidefinite programming bounds. Given the machinery of 
positive-definite kernels, it is easy to write down linear and semidefinite programming 
bounds for energy minimization. We will write X'^/G for the set of orbits of G 
acting on pairs, and [x,y] will denote the orbit of the pair {x,y). 

The linear programming bounds involve linear constraints on the G-invariant 
pair distribution of a code. Given a finite subset C C X, define 

^x.,v = \{{u,v) e : [u,v] = [x,y]}\. 

These numbers satisfy some obvious linear constraints. They are all nonnegative, 
and their sum over the set X'^/G of orbits is |Cp. Furthermore, A^.x = \C\ if G acts 
transitively on X, and even if that is not the case, the diagonal terms A^.x are the 
orbit sizes of G acting on X. 

In addition to these obvious constraints, each G-invariant positive- definite kernel 
K : X X X C gives a more subtle constraint. Specifically, 

J2 Ax^yK{x, y)^ ^ 0. 

[x,y]eX^/G x,veC 

Of course, the sum over [x, y] G X'^ /G means a sum over representatives of the orbits. 
Positive-definite kernels are remarkable because they have this property despite 
typically being negative at many points. Pfender |P07j discovered that they are not 
the only such functions, and one can occasionally improve the linear programming 
bounds by incorporating his functions. Unfortunately the improvement seems to 
be small in general, and the method becomes much less systematic (because the 
representation-theoretic context is lost), so we will not use Pfender's approach here. 
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Given a G-invariant potential function / : X x X — > M, define the energy of C by 

x,y€C lx,y]eX^/G 

Potential energy is a linear functional of the pair distribution A, so by solving a 
linear programming problem (possibly infinite dimensional) one can optimize it 
subject to the linear constraints described above. To prove a lower bound for energy, 
valid for arbitrary sets of \C\ points in X, one need only find a feasible point in the 
dual linear program. 

When X — G/H and (G, H) is a Gelfand pair, linear programming bounds 
express all systematically available information about the pair distribution. In 
more general cases, linear programming bounds can be generalized to two-point 
semidefinite programming bounds. They work the same way, except they use the 



matrix- valued positive-definite kernels K^ y developed in Subsection 2.1 For each 
such kernel. 



[x,y\£X^/G 



A-x.y^x.y CZ 0, 



which is a semidefinite constraint on A. Thus, optimizing energy subject to these 
constraints becomes a semidefinite programming problem. 

2.3. Three-point bounds and beyond. The most important application of semi- 
definite programming bounds is to prove three-point bounds (as in |BV08| ). Fix a 
point e G X and let H = Stab(3(e) be its stabilizer. The semidefinite programming 
bounds give constraints on _ff-invariant pair distributions on X, and symmetrizing 
e with the other two points turns them into constraints on G-invariant triple distri- 
butions. One can then attempt to optimize energy (or other quantities) subject to 
these constraints. 

Of course, there is nothing sacred about three points, and one can prove fc-point 
bounds in the same way. (Musin |M07| was the first to formulate these bounds for 
spherical codes.) However, as k increases the bounds become increasingly difficult 
to compute with. The difficulty is that fc-point distributions are functions on X'' /G, 
so /c-point bounds involve optimization over functions on this space. When X'^ /G is 
one-dimensional, the optimization problem is usually doable, but it rapidly becomes 
intractable as the dimension increases. 

For a concrete example, suppose X — 5"^^ and G — 0{n). Then elements of 
X^ /G are determined by their pairwise distances, so dimX'^/G = (2). For fc = 3, 
functions of three variables are barely tractable, but for A: = 4, functions of six 
variables are too complicated: the space of polynomials of degree at most m in 
six variables has dimension (™fj^^) , which grows too rapidly to allow for extensive 
computations. 

The situation can be much worse for other spaces. For example, a pair of points 
in CP"~^ is determined up to isometrics by the distance between them, but for 
triples of points there is a fourth parameter, namely a complex phase change. (Two 
unit vectors in C" can easily be phase shifted so that their inner product is real, but 
for three vectors that is generally impossible.) The Grassmannian of fc-dimensional 
subspaces in M" is even worse, since min(fc, n — k) parameters are required to 
determine a pair up to isometrics (see, for example, |B06j ). 
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2.4. Explicit computations. Of course, to apply any of these bounds in practice, 
one must carry out the representation-theoretic computations explicitly. Fortunately, 
in the cases of interest in this paper, no more calculations are needed to set up 
fc-point bounds than two-point bounds. 

Recall that for the sphere 5"^^, it is a theorem of Schoenberg [S42] that 
the cone of 0(n)-invariant positive-definite kernels is spanned by the functions 
{x,y) P^{{x,y)). Here, PJ} denotes the fc-th degree Gegenbauer polynomial for 
5'""^. See Subsection 2.2 of |CK07| for a brief account of these calculations. In 
terms of Theorem |2.1[ these positive-definite kernels correspond to the irreducible 
representations of 0{n) that contain a nonzero vector fixed by 0{n — 1) (and the 
other irreducible representations do not have corresponding kernels). 

Once this characterization of positive-definite kernels is known, one can extend 
it to fc-point bounds without needing any additional representation theory or 
information about special functions. For example, three-point bounds can be 
handled as follows. This gives a new proof of Corollary 3.5 in |BV08| . 

Theorem 2.3. Let H be the stabilizer in 0{n) of a point e in S"~^ . For each 
k > 0, there is an H -invariant matrix-valued positive-definite kernel on S"'~^ that 
takes Xi,X2 G S''^^^ to the infinite matrix whose ii,i2 entry (indexed starting with 
Q) is 



where Uj — (e, Xj) and t — {xi, X2) ■ 



Of course, for numerical computations one uses finite submatrices of these infinite 
matrices. 



Proof. Let X — S*"" , with e and H as in Theorem 2.3 We must compute 
matrix- valued positive-definite kernels on the vector space Hom^/ (X, V) , where V 
is an irreducible unitary representation of H. Of course, Hom//(X, T^) is infinite 
dimensional, but we will choose a basis for a dense subset. 
Let 

r = {ye5"-i:(e,y)=0} 

be the equator relative to e. To specify a map ip S Hom//(X, y), we just need 
to specify its restriction to the equator and to all the parallel slices of the sphere. 



Fortunately, that is relatively manageable. As in Subsection 2.1 Hom//(y, V) = 
because H acts transitively on Y, and the Gelfand pair property implies that 
is at most one-dimensional; the same is true for all the parallel slices. Thus, the 
map if is determined by specifying a scaling factor on each slice (in other words, a 
function of one variable). 

Suppose dimV^^ = 1, and choose an element (p S llomH{Y,V) that is not 
identically zero. We will use (p to define ip on the equator Y. To complete the 
determination of p, we must specify the scaling factor on each parallel slice. We 
will use scaling factors that are polynomials in the inner product with e (up to 
multiplication by some fixed function / to be specified later). Specifically, for 
— 1 < M < 1 and y ^ Y, define 

iPi{ue + \/l - u'^y) = u'f{u)(p{y). 
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The function ipi is i?-equivariant because ip is, and as i varies these functions are 
dense in Hom/f(X, V). 

To compute the matrix-valued positive-definite kernel, we need to compute 

('/'ii (2^2), <y3i2 (0:2)) for xi,X2 S X. If we write Xj = Uje+ ^ 1 — uj yj , then 

{iPiAx2),fi2ix2)) = U-^U^ f{ui)f{u2) (<^l(yi),(p2(2/2)). 

The inner product ((^i(yi), (^2(2/2)) can be computed using Schoenberg's theorem, 
because y is a sphere and H is the full isometry group of Y . Specifically, there is 
some k (depending on V) such that 

('^i(2/i),^2(y2)> -Pr'((2/i:y2)), 

if (p is rescaled appropriately, which we can assume. 

If we let t ~ {xi, X2), then 

t-UiU2 

V(l-«i)(l-"2) 

It follows that the ii, 12 entry (95^^ {X2), V>i2 {^2)) of the matrix-valued positive-definite 
kernel is 

<4v(^^i)/(«2)pr^' ^ 



^{l-ul){l-ul))- 
It is convenient to choose f{u) = (1 — v?)^/"^ . Then 

((^,:,(X2), (0:2)) = ((1 - u\){l - 1 ' ^ " "^"^ 



y(i-«2)(i_^2)y 

This choice has the advantage that the right side is a polynomial in ui, U2, and t 
(because is an even or odd function, according as k is even or odd). □ 

The same approach works straightforwardly for /c-point bounds: choosing H to 
be the stabilizer of several points recovers the results of Musin [M07j . It also works 
for projective spaces. As in the case of spheres, the calculations for an n-dimensional 
projective space with respect to the stabilizer of a point reduce immediately to those 
for an (n — l)-dimensional space with respect to the full group. 

The calculations in real projective space are almost the same as those in the 



sphere. If we lift points arbitrarily to the sphere (as discussed in Subsection 1.3), 



then we just need to avoid any sign ambiguity. Specifically, in terms of the three 



inner products ui, U2, and t from Theorem 2.3 we want only terms that are invariant 



under changing the signs of two of the three variables. That means we take only 
the entries for which «i = 12 = fc (mod 2). This submatrix defines a matrix- valued 
positive- definite kernel on real projective space. 



3. Three-point bounds for energy minimization 

In this section, we write down three-point bounds for energy minimization in 
spheres or real projective spaces. We begin with spheres, after which we can easily 
adapt the answer to real projective spaces. All of the required theory is in Section [2j 
but there are a number of details that must be worked out carefully. 
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It will be convenient to write potential functions in terms of the inner product. 
Given a potential function /: [—1, 1) — )• R, define 



2 

Because \x-y\^ = 2- 2{x,y), if we set f{t) = g{2 - 2t), then Ef{C) = Eg{C). The 
function / is absolutely monotonic (i.e., it is infinitely differentiable and all of its 
derivatives are nonnegative) if and only if g is completely monotonic. Thus, C is 
universally optimal if and only if it minimizes Ef for all absolutely monotonic /. 

Given a configuration C C 5""^ with |C| = TV > 2, define the corresponding 
triple distribution by 

Au,v,t = y, z) gC^ : {x, y) = u, {y, z) = v, {z, x) = t}\. 

This function counts ordered triples of points modulo the action of 0{n). Of course, 
Au,v,t > 0, and it equals zero unless —l<u,v,t< 1 and 

1 + 2uvt -u^ -v^ -t'^ >0. 

In other words, the 3x3 Gram matrix 




must be positive semidcfinite. Furthermore, Au,v,t is symmetric in u, v, and t. It 
satisfies the identities Ai n = N , 



^ Au,v,t = 

u,v,t 

and 



v,t 

Define 

D = {{u,v,t) : -1 < u,v,t < I &nd 1 + 2uvt -u^ -v^ -t^> 0}. 

Because the potential function is not necessarily defined at inner product 1, it will 
be important to work with sums over D. For example, 

{u,v,t)£D u,v,t u 

= N^ - + 2N 
= N{N -1){N -2). 

Equivalently, summing over D counts ordered triples of distinct points. 
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We can express Ef{C) as a sum over D by writing 



(C) = ^^A„,„,i/(u) 

U<1 

^ Au,v,tf{u) 



2N 

U<l,V,t 



2^ (2E^"^".i/(")+ E 

\ «<i (u,'u,t)e-D / 

\ iu,v,t)eD J 



It follows that 



(3.1) EfiC)^—— J2 + + /(<)). 

^ ' {u,v,t)eD 

Let S'^{u,v,t) be the infinite matrix whose i,j entry (indexed starting with 0) is 
the symmetrization of 

in u, V, and t (i.e., the average over all permutations of the variables). The sum of 
this matrix over the code is positive semidefinite by Theorem |2.3[ in terms of Au.v,t, 

If we break the sum up according to which variables equal 1, we find that 
Y Au,.,tSi:{u, v,t) + 3Y Au,u,iSUu, u, 1) + NS'^il, 1, 1) h 0. 

(u,v,t)eD u<l 

(It might appear that S^{u,v,t) is undefined when one of the variables equals 1, 
but in fact all its entries are polynomials in u, v, and t.) The same trick as we used 
for Ef shows that X]tt<i Au,u,iSJ^{u,u, 1) equals 

^ ' {u,v,t)<£D 



(In fact, this is simply (3.1 ) with f{u) replaced with 2S''^{u, u, 1).) Thus, if we define 
T^{u, V, t) = {N- 2)51? (u, V, t) + S'^iu, u, 1) + S^v, v, 1) + Sl{t, t, 1), 

then 

N{N ~ 2) S'^ {1, 1,1)+ Y A^,v,tT^[u,v,t)>G. 

(u,v,t)£D 

Furthermore, note that '5'^(1, 1, 1) is the zero matrix unless fc = 0, in which case all 
of its entries are 1. If we let J denote the all 1 matrix, then our inequality becomes 

N{N-2)5kfiJ+ E Au^v,tTj:{u,v,t)>0, 
{u,v,t}eD 

where S denotes the Kroneckcr delta function. 
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Theorem 3.1. The minimal value of Ef for N > 2 points in S" ^ is greater than 
or equal to the optimum of the following semidefinite program: minimize 

— i— E + /(-) + / W) 

over all choices of A^ y t > for {u,v,t) S D, subject to 

E A„,„^*=iV(iV-l)(iV-2) 

(«,D,t)G-D 

and 

N{N~2)Sk,oJ+ E Au,y,tT^{u,v,t)hO 
(u,v,t)eD 

for k>0. 

To compute a lower bound for energy, we will use the dual semidefinite program. 
Suppose we define a function _ff on D by 

H{u,v,t) ^ c + J2{Fk,Tj:iu,v,t)), 

fc>0 

where c is a constant, Fk denotes an infinite symmetric matrix, and the inner product 
on symmetric matrices is the trace of the product. We assume only finitely many 
entries of Fk are nonzero, so the inner product is well defined. 

Theorem 3.2. With the notation established above, if Fk ^ for all k and 

Hiu,v,t)<M±m±m 

3 

for all {u,v,t) £ D, then the minimal value of Ef for N points in S"~^ is at least 

N 

-((iV~l)c-(Fo,J)). 
Proof. For any iV-point code C with triple distribution A, 

E A^^y^tHiu,v,t) <2{N-2)Ef{C) 

{u,v,t)£D 



by (3.1 1. On the other hand, the inner product of two positive semidefinite matrices 
is nonnegative, from which it follows that 

N{N ~2){Fk,J)5k,o+ J2 Au,,,t{Fk,Tl\u,v,t))>0. 

Summing over k yields 

N{N-2){Fo,J)+ E A^,v^t{Hiu,v,t)^c)>0, 
{u,v,t)eD 

and combining these inequalities with J2{u v t)eD ^y^v.t = N{N—l){N—2) completes 
the proof. □ 

As discussed at the end of Section [2j the real projective case is almost identical. If 
/ is an even function, then we can lift points arbitrarily to S"~^ without introducing 
any ambiguity in i?/. It will prove convenient to use a third variant E of the notation 
for energy (in addition to E and E); that may seem unnecessary, but E has the 
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clearest connection to physics, E is the most convenient for spheres, and E is the 
most convenient for real projective space. Given a potential function /: [0, 1) — s- M, 
define 

1 

2 ^ 

x,yeC 

Here x and y represent lifts to 5""^ of points in ]RP"~^. Note that each point in 
jjpn-i j^g^g but we only include one of them in the sum (chosen arbitrarily). 

Let be the submatrix of S*^ indexed by numbers with the same parity as k, 
and define 

f^iu, V, t)^iN- 2)5," (u, V, t) + SUu, u, 1) + SUv, v, 1) + 5^(t, t, 1). 

Then the three-point energy bounds for EP"~^ are exactly the same as those for 
5"^^, except with S and T replacing S and T: 

Theorem 3.3. // 

H{u,v,t)^c + ^{Fk,f;:iu,v,t)), 

k>0 

with Fk y for all k and 

H{u,v,t) < ■'-^ 

o 

for all {u,v,t) G D, then the minimal value of Ej for N points in MF"~^ is at least 

N 

-((iV-l)c-(Fo,J)). 



The choice of c and Fa,Fi, . . . in Theorems |3.2| and |3.3| can be optimized using 
semidefinite programming. First, assume Fi^ — for k beyond some bound, and that 
for all k the i, j entry of Fk vanishes if i and j are sufficiently large, so that only 
finitely many variables need to be considered. Assume also that / is a polynomial 
(although that assumption can be relaxed, at the cost of additional complications). 
To express the constraint that 

H{u,v,t) < ^'^^ ■'^^ 

3 

for (m, V, t) e Z?, we will use a sum of squares representation due to Putinar. 
Unfortunately, there is a nontrivial error in the paper [P93j (see |M08[ p. 98]), so 
we must be careful here. The space D is defined by the constraints 1 — > 0, 
1 — w'^ > 0, 1 — > 0, and 1 + 2uvt — — — t^ > 0, and this representation 
of D is stably compact (i.e., it defines a compact set even if the coefficients are 
perturbed); furthermore, only the last constraint has odd degree. It follows from 
Corollary 7.2.5 in [MOSj that if a polynomial is strictly positive on D, then it can 
be expressed in the form 

Ga{u,v,t) 

+ Gi(u, V, t){l - u^) + G2{u, V, t)(l - v^) + Gsiu, v, t){l - t^) 

+ G4u, V, t){l + 2uvt -u^ -v^ - t^), 

where Gq{u, v,t), . . . , Gi{u, v, t) are sums of squares of polynomials in u, v, t. A 
polynomial is a sum of squares if and only if it is of the form {z{u, v, t), Mz{u, v, i)). 
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where M is a symmetric, positive-semidefinite matrix and z{u,v,t) is a vector 
whose entries are monomials in u,v^t; thus, being a sum of squares is a semidefinite 
condition. If we apply this approach to 

o 

then we get semidefinite programs that are guaranteed to come arbitrarily close to 
the true optimum in Theorem |3.2[ Of course, we hope they will actually achieve 
the true optimum, and in practice that occurs. 



4. Proof of universal optimality 

In this section we prove Theorem There are two fundamental difficulties. 
One is that, although we can solve a semidefinite program numerically to obtain a 
bound for any given potential function, the solutions are very cumbersome and it is 
not easy to produce a rigorous, exact solution. Bachoc and Vallentin dealt with this 
difficulty in [B V09b| . but their problem is substantially simpler and they employed 
ad hoc techniques. We will develop a more systematic approach. 

The second difficulty is that to prove universal optimality, we must prove optimal- 
ity for infinitely many different potential functions, namely all completely monotonic 
functions. Of course we only need to deal with the extreme rays in this cone, but 
there are infinitely many of them as well. To address this difficulty, we will construct 
a finite set of functions (not all completely monotonic) such that optimality for 
all of them implies universal optimality. In other words, we will replace the cone 
of completely monotonic functions with a larger cone that has only finitely many 
extreme rays, so we will prove a slightly stronger result than universal optimality. 

Recall that 

x,yeC 

where x and y represent lifts to of points in MP"~^ (with only one lift of each 

point being used) . One advantage of squaring the inner product is that it becomes 
invariant under sign changes, but it also relates well to the chordal distance. The 
squared chordal distance between x and y is 1 — (x,j/)^, so Ef{C) = Eg(C) with 
g(t) = /(I — t). Thus, C is universally optimal in MP"^^ if and only if it minimizes 
Ef for each absolutely monotonic function /. The cone of absolutely monotonic 
functions on [0, 1) is spanned by the monomials f{t) = t' (see Theorem 9b in [W411 
p. 154]), so it suffices to prove optimality for these monomials. 

To reduce proving universal optimality to a finite problem, we will apply Hermite 
interpolation. Recall that given a nonempty, finite multiset T of points in M (with 
the multiplicity of i G T denoted multy(t)), the Hermite interpolation Hrif) of a 
function / is the unique polynomial of degree less than 

^multT(t) 

that agrees with / to order multT(i) at each t G T. (I.e., f'^^{t) = {'HtU))''^ {t) 
for all i < multr(t).) See Subsection 2.1 of |CK07j for background on Hermite 
interpolation. The following observation of Yudin will be crucial: 
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Lemma 4.1 (Yudin |Y92| ). Let T be a finite, nonempty multisubset of an interval 
I such that each point in T has even multiplicity, except for the endpoints of I 
(which are allowed to have even or odd multiplicity). For each absolutely monotonic 
function /:/—>■ M, 

nrifm < fit) 

for all t (z I . 



Lemma 4.1 follows from the remainder formula for Hermite interpolation (see, 



for example, Lemmas 2.1 and 5.1 in [CK07j for a proof). 

Lemma 4.2. Let T — {ti, . . . ,tM} &e a nonempty multisubset of an interval I 
(written with ti repeated according to its multiplicity). If f : L ^ R is absolutely 
monotonic, then T-Lrif) is a nonnegative linear combination of the partial products 

m 

t^Y[{t-t,) 

for < m < M . (Of course, the m — case is the constant function 1.) 



What Lemma [4~2] says depends on the ordering oi ti, . . . ,tM, but it is true for 
every ordering. 



Proof. We prove Lemma 4.2 by induction on M. For A/ = 1, Hxif) is the constant 
function f(ti) and the lemma is trivial. Otherwise, let T' = {^2, ■ • • j^a/}- Then 

(4.1) (HT(/))(t) - /(ti) + (t - ti)(HT'(g))W, 

where g{t) — [f (t) — f (ti)) / {t — ti) (and g{ti) = f'{ti)). The function g is absolutely 
monotonic on /, by Proposition 2.2 in |CK07| : alternatively, that can be seen directly 
via 

d" ff{t)-fiti)\ _ Slf^''+'\u){u-t,fdu 



dt'' V t-h J (t-ti)'=+i 

which is the fundamental theorem of calculus for fc = and can be proved by 
induction (or by using a Taylor series expansion for {f{t) — f(ti))/(t — ti) about 
ti). Now applying the lemma inductively to Hr'^g) and using (4.1) completes the 
proof. □ 



Combining Lemmas |4.1| and |4.2| reduces proving universal optimality to a finite 
number of cases, as follows. 

Corollary 4.3. Let C be a finite subset o/MP"^^ (represented via arbitrary lifts 
to S'^-^), and let T = {ti, . . . , tM} he any finite multisubset of [0, 1) (written with 
multiplicities) such that all elements other than have even multiplicity and 

{{x,y)'^ : x,y eC, X ^y} CT. 

If C minimizes Ef^ for the potential functions 

m 

U{t)^l[it-t,) 

1=1 

with < TO < M, then C is universally optimal in MP"^^. 
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Proof. Let /: [0, 1) — > M be absolutely monotoiiic. By Lemma 4.1 / > Hrif) on 



[0, 1), and by Lemma 4.2 there are nonnegative coefficients Ao, . . . , Am-i such that 

'^t(/) = Ao/o + • ■ • + Aa/-i/j\/-i- 
Thus, for any configuration V with |2?| ~ |C|, 

= XaEf,{V) + ■ ■ ■ + \M-iEf,,_,{V) 
>XoEf„{C) + --- + XM-iEf,,_,iC) 

= Ef{C), 

where the last equation holds because / = ^t(/) at each squared inner product 
between distinct points in C. □ 

Unfortunately, even if C is universally optimal, there is no guarantee that it is 
optimal for the potential functions constructed in Corollary |4.3| That seems to 
depend on luck and the proper choice of T. 

The proof of the main theorem in |CK07] can be recast in this framework, which 
simplifies the argument given there: specifically, it replaces the use of conductivity. 
As a side benefit, this approach allows us to give a substantially simpler proof of the 
universal optimality of the regular 600-cell in than was given in |CK07] . (That is 
the one case that was not proved by a conceptual argument, but rather by somewhat 
complicated calculations.) See Appendix [C] for more details. 

For the rhombic dodecahedron code in MP^, the squared inner products are 0, 



1/9, and 1/3. If we take T = {0,0,0, 1/9, 1/9, 1/3, 1/3} in Corollary |4J we find 
that it suffices to prove optimality for the potential functions f{t) = 1, t, t^, t^, 
t^(t - 1/9), t^{t - 1/9)2, and t^(t - 1/9) ^(t - 1/3). Of course the constant function 
is trivial, and two-point bounds are sharp for f{t) = t (the rhombic dodecahedron 
code is a projective 1-design). Thus, only five cases remain. 

The triple repetition of in T is not essential for the proof, but it is helpful. 
Our approach fails if we take multy(O) — 1; with multT(O) = 2, it works but the 
numerical calculations are more cumbersome. 

For the rhombic dodecahedron code C, 

^/(C)= 3/(0) + 6/(1/9) + 12/(1/3). 

Thus, we wish to prove a lower bound of 3/(0) + 6/(1/9) + 12/(1/3) for energy 
with respect to each of the potential functions f{t) — t"^ , , t^{t — 1/9), t^{t — 1/9) 2, 
and t^{t — l/9)2(i — 1/3). In fact, in each case we prove something stronger, namely 
that the same lower bound holds not just for the potential function f{t) but also for 

m-m ■ 

Because fo{t) < f{t) for all i > 0, we have Ef^ < Ej, and equality holds for the 
rhombic dodecahedron code. 

For each potential function, we use an auxiliary function 

5 

H{u,v,t) = c + Y,{Fk,fl}{u,v,t)), 

k=0 
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where the matrix Fq is 5 x 5 (i.e., all entries outside of the upper left 5x5 block are 
zero), Fi and F2 are 4 x 4, F3 and F4 are 3x3, and F5 is 2 x 2. We then optimize 
the bound obtainable from Theorem 13.31 To enforce the constraint that 

3 

we write 

o 

where Af ^ is a 120 x 120 matrix and z{u,v,t) is the vector consisting of all 
monomials in u,v,t of degree at most 7. Note that this condition is very strong 
(for example, it forces the degree of the left side to be at most 14 and it forces the 
inequahty to hold for all {u,v,t) S R^), but in fact it works. 

The program SDPA-GMP can do arbitrary-precision semidefinite programming 
|N-|-08| , and CSDP can do sufficiently high-precision semidefinite programming for 
our purposes |B99j . Using either of them, one can solve this semidefinite program 
numerically and verify that the bound is nearly sharp. However, getting a rigorous 
proof takes more work. We would like to round an approximate solution to get an 
exact solution, but the rounding process can violate the constraints at locations 
where there is equality, so it must be done carefully. 

To get a sharp bound, we will impose two types of conditions. First, there are 
necessary constraints on the matrices Fk from complementary slackness (i.e., the 
conditions under which equality can hold in the proof of Theorem 3.3). Specifically, 
for < fc < 5, the inner product of Fk with 



35Sk,oJ + 6T| (0, 0, 0) + 24T^3 



111 

3'~3'"3 



must vanish. The coefficients and arguments of come from the triple distribution 
of the rhombic dodecahedron code. Second, we require that H(u,v,t) must equal 
(/o(w^) + fo{v^) + fo{t'^)) /3 at the triples of inner products that occur, namely 
the five arguments to in the formula above, and all their first partial derivatives 
must agree as well. These conditions are linear in the variables c, fo, . . . , ^5, and 
M. 

To perform the rounding correctly, we write the problem in a basis for the set of 
all c, Fq, ■ . ■ , F^, and M that satisfy the constraints listed in the previous paragraph, 
together with the defining equations such as 

— ^ — ^ — ^ — - — H [u, V, t) — {z{u, v,t),Mz{u,v,t)). 

3 

We then solve the semidefinite program numerically for the coefficients in this 
basis, and we round the coefficients to eight or nine decimal places to get an exact 
solution. It is not guaranteed that this rounding process will work: the problem 
is the semidefinite constraints, because unexpected zero eigenvalues may become 
negative due to the rounding. However, we do not run into that difficulty, because 
there turn out to be no zero eigenvalues except for the ones forced by the constraints 
we have built into our basis. Note that the exact auxiliary function is far from 
unique, and our rounding strategy makes essential use of this freedom. 
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Using this method in each of the five remaining cases proves optimality in 
Theorem |1.1| Carrying out the calculations requires a computer, most notably 
because M is a 120 x 120 matrix, but all the calculations are completely rigorous. In 
particular, we use exact rational arithmetic, with no floating-point approximations. 
See Appendix [P] for more details. 

To complete the proof, we must verify uniqueness. The Hermite interpolation 
at more than two points (counting with multiplicity) of an absolutely monotonic 
polynomial cannot be linear unless the original polynomial is itself linear: otherwise, 
by RoUe's theorem the second derivative of their difference would have a root 
between the points, and hence the second derivative of the original function would 
vanish, which would violate absolute monotonicity. Thus, to prove uniqueness for 
nonlinear potential functions, we just need to prove it for the five cases f(t) = i^, 
t^, t^{t - 1/9), t^{t - 1/9)2, and t^{t - l/^Y{t - 1/3). Let f{t) be one of these, and 
let c, Fq, . . . , F5 and M be as above (for the modified function /o). 

Because f > fo pointwise, 

with equality only when m^, z;^, t'^ e {0, 1/9, 1/3}. Thus, there are only finitely many 
points at which equality is possible, and we can simply check all the cases. For 
{u,v,t) G -D, the triple (u, u,t) can occur in the global minimum for energy only 
if equality holds (otherwise, something is lost in the inequalities in the proof of 
Theorem |3.3[ ) . It follows that the only triples in D that can occur in an optimal 
configuration are (0, 0, 0), (-1/3, -1/3, -1/3), (-1/3, 1/a/3, 1/a/3), (1/^3, l/V^, 0), 
and (1/3, l/-\/3, l/-\/3), up to equivalence defined by permutations of the coordinates 
and pairs of sign changes. Let Ni, . . . , N5 denote the numbers of times each 
equivalence class occurs between a triple of distinct points, with the numbering from 
1 to 5 in the order given here. 

The numbers iVi, . . . ,iV5 must satisfy several linear constraints. Their sum must 
be 210, and for < fc < 5, the inner product of with 



354,0 J + Nif^ (0, 0, 0) + N2fi 



111 

3'^3'"3 



must vanish, for the same complementary slackness reason as above. If we solve 
these simultaneous equations, we find that A^i = 6, N2 — 24, — 36, A'4 — 72, 
and TVs = 72. 

Because A^i — 6, there is a unique triple of orthogonal points, up to permutations. 
Call them ei, 62, 63 (as usual, we deal with arbitrary lifts to the unit sphere). The 
only possible inner products between points in the code are 0, ±1/^/3, ±1/3, and 
±1, so every point other than ei, 62, 63 must have inner product 0, ±l/\/3, or ±1/3 
with each Ci (note that —1 is ruled out by the uniqueness of 61,62,63). Because 
ei, 62, 63 are orthogonal unit vectors, the sum of the squares of the inner products 
with them must be 1. That means they must all be ±l/-\/3. In other words, with 
respect to the orthonormal basis 61,62,63, the code consists of the basis vectors 
together with four points in the cube (±l/-\/3, ±l/-\/3, ±1/a/3)- It follows that it is 
the rhombic dodecahedron code. 
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Finally, for uniqueness as a spherical code, we use f{t) — t^[t ~ l/9)^(t — 1/3). 
Every code whose minimal distance is as large as the rhombic dodecahedron code's 
minimal distance has Ef < 0, and hence it minimizes Ef (because the minimal 
energy is 0). Thus, uniqueness for this potential function implies uniqueness as a 
spherical code. This completes the proof of Theorem 

5. Open questions 

5.1. Three-point bounds and generalizations. One pressing question is why 
three-point bounds are not sharp more often. Of course, it is unreasonable to expect 
that sharp bounds will ever be common in packing or energy minimization problems. 
The phenomena seem to be intrinsically complicated (see, for example, |B-|-09| ), 
and the most one can hope for is to prove optimality in exceptional cases. However, 
even just for spheres, the two-point bounds are sharp for three infinite families and 
a dozen sporadic cases (see Table 1 in |CK07| ). and they are sharp even more often 
in projective spaces. Thus, it is surprising that only three sharp cases are known 
for three-point bounds, excluding the cases where two-point bounds already suffice. 
Surely there must be more examples, but so far we have not been able to find them. 

It is natural to ask what sort of bounds are required to prove optimality for 
an iV-point configuration. Of course TV-point bounds suffice, but only because 
in the process of optimizing over A'^-point distributions they implicitly optimize 
over all A'^-point configurations. For what sorts of families of configurations might 
o(iV)-point bounds suffice? 

It would also be very interesting to do explicit calculations with four-point 
bounds. This project would involve exceptionally time-consuming calculations, 
but it is possible in principle and perhaps in practice. Gijswijt, Mittelmann, and 
Schrijver have carried out the analogue for binary error-correcting codes |GMS10| . 
which suggests that the continuous version may also be tractable. 

If we had further examples of sharp three-point bounds, they might suggest 
organizing principles that could lead to a deeper understanding. For example, for 
two-point bounds in two-point homogeneous spaces, Levenshtein |L92j proved a 
beautiful criterion for being an optimal code in terms of strength as a design: any 
m-distance set that is a (2m— l)-design (or even an antipodal (2to — 2)-design) is an 
optimal code. In these cases, the two-point bounds are sharp and can be understood 
conceptually, with no need for numerical calculations. The same criterion applies 
more generally to prove universal optimality (Theorems 1.2 and 8.2 in |CK07| V 
Every known case in which two-point bounds are sharp fits into this framework, 
with one exception, namely the regular 600-cell |A99[ ICK07) . 

We cannot even propose a conjectural generalization of Levenshtein's criterion to 
three-point bounds. However, we hope that some general principle will explain the 
sharp cases, offer guidance for how to locate more of them, and lead to proofs that 
involve no explicit numerical computations. 

Another question is whether there are good applications of three-point bounds to 
potential functions that depend on triples of points, rather than pairs. Of course, 
there is no obstacle to writing down such bounds, but it is not clear that there are 
any exciting examples. Pair potentials are far more common in physics, and we do 
not know which higher-order generalizations may be worthy of investigation. 



5.2. Projective spaces. Theorem ] 1.2 
in MP^, but as we will see in Subsection 



set tles the question of universal optimality 
5.3 there may be cases in which three-point 
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bounds are sharp but universal optimality does not hold. We are unaware of any 
cases in projective space besides the rhombic dodecahedron code in which three-point 
bounds are sharp and two-point bounds are not, but it is difficult to imagine that it 
is the only example. 

It is especially intriguing that the three-point bounds prove universal optimality 
for this code, and it would be fascinating to find other such cases. It is unlikely 
that there are any in MP^. In addition to the universal optima that occur in every 
dimension (up to n orthogonal lines in M" , or n + 1 connecting the vertices of a 
regular simplex to its centroid), there are at least two other universally optimal line 
configurations in M'*: 12 lines through opposite vertices of a regular 24-cell and 60 
from a regular 600-cell. In each of these cases, universal optimality follows from 
two-point bounds: the last one can be proved using the techniques from Section 7 
of |CK07| ■ and all the others follow from Theorem 8.2 in [CK07) . We know of no 
other candidates for universal optimality in KP^. 

Five-dimensional line configurations are more promising. In addition to the 
generic cases of up to six lines, two-point bounds prove universal optimality for a 
ten-line configuration from (CHS96| (by Theorem 8.2 in |CK07| ). In the subspace 
of consisting of all points with coordinate sum zero, this configuration consists 
of the lines through all the permutations of (1,1,1,— 1,-1,-^1). If we include 
also the lines through the permutations of (5, —1, —1, —1, —1, —1), we get a 16-line 
configuration also studied in |CHS96| . The orthoplex bound is sharp for it, so it is 
an optimal projective code, but nothing more is known about energy minimization 
or uniqueness. 

Conjecture 5.1. The 16-line configuration in described above is universally 
optimal and is the unique optimal 16-point code in MP*. 

The parallels between this code and the rhombic dodecahedron code are notewor- 
thy. The seven lines in M.^ can be constructed completely analogously, using the 
permutations of the points (1,1,-1,-1) and (3, — 1, — 1, — 1). Equivalently, both 
codes can be constructed by starting with the lines through the vertices of a regular 
simplex centered at the origin, and then filling in all the holes (i.e., the lines at 
maximal distance from the original lines). 

Both codes can be proved optimal using the orthoplex bound. Nevertheless, 
the three-point bounds for potential energy do not seem to settle Conjecture |5.1t 
although they suffice to prove Theorem |1.1[ Perhaps four-point bounds can prove 
Conjecture |5.1[ but the calculations required to investigate this could be formidable. 
The pattern certainly does not continue to M7: the analogous configuration has 
(4)72 + 8 = 43 lines, but comparison with Table 4 from |CHS96j shows that it is 
not even an optimal projective code. 

5.3. Spheres. Although in this paper we focus on real projective space, we have 
also applied three-point bounds to potential energy minimization on spheres. We 
have not found any sharp cases beyond the two identified by Bachoc and Vallentin 
[BV09b| ■ namely the Petersen code in (the ten edge midpoints of a regular 
simplex) and the square antiprism in R^. Using three-point bounds, they proved 
rigorously that the Petersen code is an optimal code, and based on their calculations 
they conjectured that the bounds are also sharp for the antiprism. 

For ten points in S^, we observe a remarkable phenomenon in the three-point 
bounds for energy. The Petersen code is not universally optimal, because it sometimes 
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has greater potential energy than the code consisting of two regular pentagons on 
orthogonal planes in M^. However, the three-point bounds appear to be sharp in 
every case: 

Conjecture 5.2. For every completely monotonic potential function f on (0,4], 
either the Petersen code or the orthogonal pentagon code minimizes the energy Ef 
among all ten-point codes in . Furthermore, the three-point bounds are always 
sharp (for whichever code is optimal). 

In other words, the three-point bounds remain sharp throughout the phase 
transition between the two ground states. If true, this represents an unprecedented 
phenomenon in coding and energy minimization. 

We believe that Conjecture |5.2| should be provable, although we have not been 
able to complete a proof. The cone of completely monotonic functions on (0, 4] is 
spanned by the functions /(r) = (4 — r)*^ (see Theorem 9b in |W411 p. 154]). For 
fc < 2, both codes have the same energy and the two-point bounds are sharp. For 
3 < < 6, the three-point bounds are sharp for the orthogonal pentagons, and for 
k > 7 we can prove that they are sharp for the Petersen code (by using the spherical 

it 



analogue of Corollary 4.3 to reduce to a finite basis). To prove Conjecture 



5.2 



would suffice to prove it just for the functions /(r) = (4 — ry -\- Q!j^fc(4 — r) with 
3 < i < 6 and k > 7, where Uj^k > is chosen to make the two codes have equal 
energy. Unfortunately, we have found it much more difficult to deal with these cases. 

The same phenomenon as in Conjecture |5.2| seems to occur also for antiprisms. 
In that case, there is a one-parameter family of configurations. Each consists of two 
squares in parallel planes and offset by a 45° angle, but the distance between the 
planes can vary. 

Conjecture 5.3. For every completely monotonic potential function f on (0,4], 
some square antiprism minimizes the energy Ef among all eight-point codes in 
. Furthermore, the three-point bounds are always sharp (for whichever code is 
optimal). 

Because this conjecture involves a continuous family of optima, it may be more 
difficult to prove than Conjecture |5.2[ It seems to require a new idea beyond what 
suffices for Theorem 1 1.1 [ and our partial progress on Conjecture |5.2| The dependence 
of the optimal energy on the potential function is also extraordinarily complicated, 
because of the need to optimize over all square antiprisms. For example, for the 
Coulomb potential, the minimal energy within the family of antiprisms is a root of 
an irreducible polynomial of degree 48. 

These conjectures suggest a broad generalization of the mechanism behind the 
known proofs of universal optimality: 

Conjecture 5.4. Suppose X is a sphere or projective space, C is a finite subset of 
X, and f is a completely monotonic function other than a polynomial. If k -point 
bounds prove that C optimizes energy under f (or prove that C is an optimal code), 
then for every completely monotonic potential function, k-point bounds will prove a 
sharp bound for energy for \C\ points in X, although C may not itself be optimal. 

For low-degree polynomials, there are numerous counterexamples: for example, for 
each spherical fc-design, two-point bounds prove that it is optimal for /(r) = (4 — r)*^. 
However, that is the only loophole we have found. Conjecture |5.4| is very strong, 
and perhaps we have been misled by the few known examples with k > 2. It 
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may hold in these examples merely because the optimal structures are in some 
sense of comparable complexity. However, Conjecture |5.4| seems to be the simplest 
explanation of the available evidence. 

The most dramatic test case will be 24 points in S^. The vertices of a regular 24- 
cell are almost certainly the unique optimal code, but they are not universally optimal 
[CCEKOT] . Instead, there is a one-parameter family of competing configurations that 
improve on the 24-cell for some potential functions. Numerical evidence suggests 
that either the 24-cell or one of these competitors is always optimal. If that is true, 
and Conjecture |5.4| is true as well, then fc-point bounds cannot settle the optimality 
of the 24-cell without also dealing with its more exotic competitors. If they can 
accomplish both with a small value of fc, it will be truly remarkable. 
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Appendix A. The orthoplex bound 

In this appendix, we explain the orthoplex bound from |CHS96| in somewhat 
different terms than those used there (although it is essentially the same approach), 
and we discuss some examples and generalizations. 

The orthoplex bound applies to codes in any Grassmannian, but for real projective 
space it is particularly simple. It is based on the map from M" to its symmetric 
square Sym^ M" that takes x to x(>^x. Under the natural inner product on Sym^ M", 
{x ^ x,y ^ y) — {x, y)^. Let ei, . . . , e„ be an orthonormal basis of K", and let 
E = Si '8 6i. Then for all x € M", \x\'^ — {x® x, E). Thus, the unit sphere 

5""^ is mapped into the cross section of the unit sphere in Sym^M" that has inner 
product 1 with the vector E (note that \E\'^ = n). 

For X G iS'"""'^, define (p{x) = (xiSix — E/n)/ y^l — 1/n. (In other words, start with 
x®x, project orthogonally to E, and rescale to get a unit vector.) Then |(/3(a;)p = 1 
and {^{x), (p{y)) = {{x, y^ — l/n)/(l — 1/n), which implies that ^(x) — (p{y) if and 
only if a; = ±y. The image of tp is orthogonal to E and is thus contained in the unit 
sphere of a Euclidean space of n{n -I- 1)/2 — 1 dimensions. 

Under ip, an (n, N, t) antipodal code is mapped to an 

{n{n + l)/2 - l,N/2, [t^ - l/n)/(l - 1/n)) 

spherical code. Thus, any bound on such spherical codes yields a bound on the 
original antipodal code. Linear programming bounds are useless in this context, 
because any positive-definite function on the higher-dimensional sphere pulls back 
under tp to a. positive-definite function on S*""^ (so there is no need to apply ip in 
the first place). However, other bounds may prove fruitful. For example, for m + 1 < 
k < 2m, no /c-point code in S™'~^ can achieve an angle of better than 7r/2 (see |R55| 
or Theorem 6.2.1 in [B04] ). It follows that when n{n -f l)/2 < A^/2 < n{n -I- 1) - 2, 
no A'^-point antipodal code in S"'~^ can have maximal inner product less than Xj \/n, 
since otherwise its image under would have maximal inner product less than 0. 
This is the orthoplex bound for projective codes. 

It is not obvious that the orthoplex bound is ever sharp, but in fact it is sharp in 
several important cases, for example when 

(A/2, n) g {(7, 3), (11, 4), (12, 4), (16, 5), (22, 6), (46, 9), (47, 9), (48, 9)}. 
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These cases are all taken from [CHS96| . The (7, 3) case is of course the optimality 
as a projective code of the rhombic dodecahedron. Another noteworthy case is 
N/2 = n{n + 2)/2 with n a power of 4 (which generalizes the (12,4) case), due to 
Levenshtein |L82| . In this case, the orthoplex bound shows that optimality still 
holds when up to n/2 — 1 lines are omitted. 

Unfortunately, the orthoplex bound does not imply uniqueness. One problem is 
that when N/2 < n{n + 1) — 2, there are continuous families of spherical codes with 
maximal inner product 0. To prove uniqueness, one would have to determine which 
such codes are in the image of ip, which seems to be a subtle question. Even when 
N/2 — n{n + 1) — 2, the image of f is uniquely determined up to isometrics of the 
sphere containing it, but it docs not obviously follow that the original code is also 
unique. 

Uniqueness is not merely difficult to prove, but in fact sometimes false. For 
example, the unique (4,24, 1/2) antipodal code is the regular 24-cell (uniqueness 
follows easily from the two-point bounds). As one can see in the list of examples 
above, the orthoplex bound proves not only that it is optimal, but also that removing 
one pair of antipodal points still yields an optimal antipodal code of size 22. However, 
that is not the only (4, 22, 1/2) antipodal code. Another one is given by the vertices 
(±1/V3,±1/V3, ±1/^3,0) of a cube, a pair (0, 0, 0, ±1) of points antipodal to all 
of them, and the two octahedra 

(±73/2, 0,0, ±1/2), (0,±^/2,0,±l/2), (0, 0, ±^/2, ±1/2) 

in the 3-dimensional affine subspaces with fourth coordinate 1/2 or —1/2. This 
example, which appears to be new, is a (4,22,1/2) antipodal code that is not 
contained in the 24-cell, because inner products of ±1/3 and ±1/4 occur. We do not 
know of any other (4, 22, 1/2) antipodal codes, but we have not searched thoroughly 
enough to be confident that no others exist. 

Note that the second Gegenbauer polynomial P2 for S*"^^ is given by (t) — 
(t^ — l/n)/(l — l/n). Thus, the map (p satisfies {ip{x), (p{y)) = P2{{x,y)). This is 
not a coincidence, and in fact the orthoplex bound can be generalized to substitute 
any positive-definite function / for P^, if it is normalized to satisfy /(I) = 1. For 
individual Gegenbauer polynomials this follows immediately from the reproducing 
kernel property (see Subsection 2.2 of [CK07' for a review of the theory), and 
nonnegative linear combinations can be obtained by taking direct sums of the 
corresponding spaces. Unfortunately, we have been unable to find any cases other 
than / — P2 that lead to sharp bounds on antipodal or general spherical codes. 



Appendix B. Universal optima in 



The proof of completeness for the list in Theorem 1.2 is based on a theorem of 
Leech |L57| . He classified balanced configurations in S"^, i.e., point configurations 
that are in equilibrium for every pairwise force law depending only on distance. 
Equivalently, in the space of configurations they are stationary points for energy 
for every potential function. This is a necessary condition for universal optimality 
in S^, and for universal optimality in MP^ it is necessary that the corresponding 
antipodal configuration in S'^ be balanced. (If it is not balanced, then the net forces 
inherit the antipodal symmetry and descend to projective space.) 

One obvious balanced configuration is a ring of equally spaced points around 
the equator, and it remains balanced if one includes the north and south poles as 
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PolyliGdron 


Combination 


N 


Cosine of mininial angle 


Record 


Octahedron 


E 


6 


1/2 


0.447... 


Octahedron 


VE 


9 


1/72 = 0.707... 


0.669... 


Octahedron 


bjb 


10 


\/2/6 = U.olo . . . 


0.685 . . . 


Octahedron 


VEF 


13 


^2/3 = 0.816 .. . 


0.768 . . . 


Icosahedron 


E 


15 


(l + V^)/4 = 0.809... 


0.786 . . . 


Icosahedron 


F 


10 


v^/3 = 0.745 . . . 


0.686... 


Icosahedron 


VE 


21 


y(5 + y5)/10 = 0.850... 


0.846 . . . 


Icosahedron 


VF 


16 


y^(5 + 2^/5)/15 = 0.794... 


same 


Icosahedron 


EE 


25 


y^(3 + V5)/6 = 0.934... 


0.872... 


Icosahedron 


VEF 


31 


a/(3 + \/5)/6== 0.934... 


0.894... 



Table 1. Balanced configurations that must be ehminated. 



well. In addition, each regular polyhedron gives several balanced configurations. 
Its vertices, face centers, and edge midpoints each form a balanced configuration, 
and every combination of them (vertices and faces, vertices and edges, all three, 
etc.) is also balanced when they are rescaled to lie on the same sphere. Leech 
proved that this list of balanced configurations in is complete. By contrast, the 
high-dimensional case is not so simple, and there exist balanced configurations with 
no symmetry at all [CEKSIQ] . 

To classify the universal optima in EP^, one need only examine the antipodal 
balanced configurations in 5*^. The cases with a ring around the equator are easily 
dealt with: whenever the ring has more than four points in it and the potential 
function is sufficiently steep, one can lower the energy by moving one point slightly 
off the equator (and of course moving the antipodal point correspondingly) . The only 
difficulty is dealing with the finitely many cases that come from regular polyhedra. 

Table [l] shows a list of the cases that must be eliminated. It omits the actual 
universal optima from Theorem 1.2 as well as any duplicate cases (for example, 
from duality). The first four columns specify the polyhedron, which combination of 
vertices, edges, and faces is being used, the number N of lines, and the cosine of 
the minimal angle between them. The final column tells the smallest such cosine 
known for a configuration of this size, from Table 1 in |CHS96| . In every case except 
A'' = 16, the record from [CHS96| is better than the balanced configuration, which 
is therefore not universally optimal since it is not even an optimal projective code. 
However, the balanced configuration with = 16 is probably an optimal code. 

Nevertheless, the iV = 16 case is not universally optimal. It is a 2-design in 
KP^ but not a 3-design. However, a 16-point 3-design exists: it is equivalent to 
an antipodal 32-point spherical 7-design in S"^, and such a design is constructed 
in Section 4 of pS96j . It follows that the 16-line configuration is not universally 
optimal, and hence the list in Theorem |1.2| is complete. 
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Appendix C. Universal optimality of the regular 600-cell 

Theorem C.l (Cohn and Kumar |CK07j ). The 120 vertices of the regular 600-cell 
are universally optimal in . 

Sketch of proof. Section 7 of |CK07| describes a construction that applies two-point 
bounds for energy. It gives a sharp bound for every absolutely monotonic potential 
function, and in any given case it is easy to check that it works, but the proof that it 



works for every potential function is fairly elaborate. We will use Corollary 4.3 (or, 
technically, its analogue for spheres) to reduce the problem to a finite calculation. 

There are eight inner products between distinct points in the 600-cell, namely 
-1, 0, ±1/2, and (±1 ± V5)/4:. CaU them ii,. . . ,i8 with -I = ti < t2 < ■ ■ ■ < ts- 
Let T be the multiset {^i, ^2; ^3, ^3, • ■ ■ 7 ^87 ^s} (note that mult7-(ti) = 1). Now 
Corollary |4.3| reduces the problem to a finite number of cases. In each case but the 
last, it is not hard to check that the approach in Section 7 of |CK07| proves a sharp 
bound, but it fails in the last case, namely 

f{t) = it-ti)it-t2)^...{t-tr)\t-ts). 

The failure does not come as a surprise, because of course this function is not 
absolutely monotonic. Nevertheless, the approach can be salvaged in this case. In 
the language used at the top of page 127 in |CK07| . one should choose an auxiliary 
polynomial of degree 18 that agrees with / to order 2 at —1 (in addition to the 
other conditions listed in [CK07|). It is then straightforward to check that this 
polynomial proves a sharp bound for Ef. □ 

This proof is still not exactly conceptual, because it relies on a fair amount of 
computation, but it requires less computation and is substantially simpler than the 



proof in [CK07j . The same approach of using the spherical analogue of Corollary 4.3 
works for all the universal optima studied in |CK07| : in fact, it is in a sense equivalent 
to the approach used there, but this method was not made explicit in [ CK07] and 
was not applied to the 600-cell. 

Appendix D. Computer calculations 

Computer algebra files for checking the proof of Theorem |1.1| can be downloaded 
as part of the source files for this paper in the arXiv . org e-print archive. In addition 
to the I^T[h;X source for the paper, downloading the source files produces eight plain 
text files: definitions.txt, optimal.txt, unique.txt, and datal.txt through 
data5.txt. These files contain code for the PARI/GP computer algebra system 
[PARIj ■ which is freely available. The code is not especially elaborate and should 
be straightforward to adapt to other systems. 

Each of the data files contains the exact values of c, Fq, . . . , Fc,, and M for one 
of the five potential functions considered in the proof of Theorem |1.1| Specifically, 
it defines a constant c and a list L of six matrices, namely Fq, . . . , F5 and M. Note 
that in PARI format, [a,b;c,d] denotes the matrix 

a b 
c d 

In the proof of the theorem, we did not specify the order of the monomials u^vH^ 
in the vector z(u,v,t), or equivalently the order of the rows and columns of M; in 
the data files, we order them lexicographically by their exponent vectors (i, j, k). 
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The file definitions . txt defines the uhraspherical polynomials and the matrices 
SJ^ and TJ}. These definitions are read by the remaining two files. 

The file optimal.txt completes the proof of universal optimality. Specifically, it 
checks that all the matrices are symmetric and positive semidefinite, it checks the 
sum of squares decomposition 

H{u, V, t) = {z{u, V, t), Mz{u, V, t)), 

and it checks that the bound is sharp. To show that a d x d matrix is positive semi- 
definite, it computes the characteristic polynomial p{x) and verifies that {—l)'^p{—x) 
has no negative coefficients. All the calculations are done using exact rational 
arithmetic, so the resulting proof is rigorous. 

Finally, the file unique . txt completes the proof of uniqueness and thus of 
Theorem 1 It finds all {u,v,t) such that 

- H(u, V, t), 

and it solves a system of simultaneous equations to determine how many times 
such a triple must occur in an optimal configuration (i.e., to compute the numbers 
iVi, . . . , iVs). It is convenient to use the number l/-\/3, but we must avoid floating 
point arithmetic to keep round-off error from becoming a problem. To do so, we use 
polynomials in a variable s and work modulo — 1/3. As above, all the calculations 
are then carried out exactly. 



References 

[A99] N. N. Andreev, A spherical code, Russian Math. Surveys 54 (1999), no. 1, 251-253. 

[AAR99] G. Andrews, R. Askey, and R. Roy, Special Functions, Encyclopedia Math. Appl. 71, 

Cambridge University Press, Cambridge, 1999. 
[B06] C. Bachoc, Linear programming bounds for codes in Grassmannian spaces, IEEE Trans. 

Inform. Theory 52 (2006), 2111-2125, arXiv:math. CO/0610812. 
[BV08] C. Bachoc and F. Vallentin, New upper bounds for kissing numbers from semidefinite 

programming, J. Amer. Math. Soc. 21 (2008), 909-924, arXiv: math/0608426. 
[BV09a] C. Bachoc and F. Vallentin, Semidefinite programming, multivariate orthogonal 

polynomials, and codes in spherical caps, Europ. J. Comb. 30 (2009), 625—637, 

arXiv:math/0610856. 

[BV09b] C. Bachoc and F. Vallentin, Optimality and uniqueness of the (4,10,1/6) spherical 

code, J. Comb. Theory Ser. A 116 (2009), 195-204, arXiv: 0708. 3947. 
[B+09] B. Ballinger, G. Blekherman, H. Cohn, N. Giansiracusa, E. Kelly, and A. Schiirmann, 

Experimental study of energy-minimizing point configurations on spheres, Experiment. 

Math. 18 (2009), 257-283, arXiv:math.MG/0611451. 
[B41] S. Bochner, Hilbert distances and positive definite functions , Ann. of Math. 42 (1941), 

647-656. 

[B99] B. Borchers, CSDP, a C library for semidefinite programming , Optim. Methods Softw. 

11 (1999), 613-623. 

[B04] K. Boroczky, Jr., Finite packing and covering, Cambridge Tracts in Mathematics 154, 

Cambridge University Press, Cambridge, 2004. 
[BG09] M. Bowick and L. Giomi, Two-dimensional matter: order, curvature and defects. 

Advances in Physics 58 (2009), 449-563, arXiv : 0812 . 3064. 
[CIO] H. Cohn, Order and disorder in energy minimization. Proceedings of the International 

Congress of Mathematicians, Hyderabad, August 19-27, 2010, Volume IV, pages 

2416-2443, Hindustan Book Agency, New Delhi, 2010, arXiv: 1003.3053. 
[CCEK07] H. Cohn, J. H. Conway, N. D. Elkies, and A. Kumar, The D4 root system is not 

universally optimal. Experiment. Math. 16 (2007), 313-320, arXiv:math.MG/0607447. 



THREE-POINT BOUNDS FOR ENERGY MINIMIZATION 



29 



[CEKSIO] H. Cohn, N. D. Elkies, A. Kumar, and A. Schiirmann, Point configurations that 
are asymmetric yet balanced, Proc. Amer. Math. Soc. 138 (2010), 2863-2872, 
arXiv: 0812. 2579. 

[CK07] H. Cohn and A. Kumar, Universally optimal distribution of points on spheres, J. Amer. 

Math. Soc. 20 (2007), 99-148, arXiv:math.MG/0607446. 
[CHS96] J. H. Conway, R. H. Hardin, and N. J. A. Sloane, Packing lines, planes, etc.: packings 

in Grassmannian spaces. Experiment. Math. 5 (1996), 139-159. 
[CS99] J. H. Conway and N. J. A. Sloane, Sphere Packings, Lattices and Groups, third edition, 

Grundlehren Math. Wiss. 290, Springer- Verlag, New York, 1999. 
[D72] P. Delsarte, Bounds Jor unrestricted codes, by linear programming , Philips Res. Reports 

27 (1972), 272-289. 

[GMSIO] D. C. Gijswijt, H. D. Mittelmann, and A. Schrijver, Semidefinite code bounds based on 

quadruple distances, preprint, 2010, arXiv: 1005.4959. 
[HS96] R. H. Hardin and N. J. A. Sloane, McLaren's improved snub cube and other new 

spherical designs in three dimensions. Discrete Comput. Geom. 15 (1996), 429-441. 
[L85] S. Lang, SL2(K), Graduate Texts in Mathematics 105, Springer- Verlag, New York, 

1985. 

[L57] J. Leech, Equilibrium of sets of particles on a sphere. Math. Gaz. 41 (1957), 81-90. 

[L82] V. I. Levenshtein, Bounds for the maximal cardinality of a code with bounded modulus 

of the inner product (Russian), Dokl. Akad. Nauk SSSR 263 (1982), 1303-1308; 

translation in Soviet Math. Doklady 25 (1982), 526-531. 
[L92] V. I. Levenshtein, Designs as maximum codes in polynomial metric spaces. Acta Appl. 

Math. 29 (1992), 1-82. 

[M08] M. Marshall, Positive polynomials and sums of squares. Math. Surveys Monogr. 146, 

American Mathematical Society, Providence, RI, 2008. 
[M07] O. Musin, Multivariate positive definite functions on spheres, preprint, 2007, 

arXiv:math/0701083. 

[N-l-08] M. Nakata, B. J. Braams, K. Fujisawa, M. Fukuda, J. K. Percus, M. Yamashita, and 
Z. Zhao, Variational calculation of second-order reduced density matrices by strong 
N -representability conditions and an accurate semidefinite programming solver, J. 

Chem. Phys. 128 (2008), 164113:1-14. 

[PARI] PARI/GP, version 2.3.4, Bordeaux, 2008, jhttp : //pari .math. u- b ordeaux . f r/' 
[P44] A. L. Patterson, Ambiguities in the X-ray analysis of crystal structures, Phys. Rev. 65 

(1944), 195-201. 

[P07] F. Pfender, Lmproved Delsarte bounds for spherical codes in small dimensions, J. 

Combin. Theory Ser. A 114 (2007), 1133-1147, arXiv :math/0501493. 
[P93] M. Putinar, Positive polynomials on compact semi-algebraic sets, Indiana Univ. Math. 

J. 42 (1993), 969-984. 

[R55] R. A. Rankin, The closest packing of spherical caps in n dimensions, Proc. Glasgow 

Math. Assoc. 2 (1955), 139-144. 
[S42] I. J. Schoenberg, Positive definite functions on spheres, Duke Math. J. 9 (1942), 96-108. 

[S05] A. Schrijver, New code upper bounds from the Terwilliger algebra and semidefinite 

programming, IEEE Trans. Inform. Theory 51 (2005), 2859-2866. 
[SW51] K. Schiitte and B. L. van der Waerden, Auf welcher Kugel haben 5, 6, 7, 8 oder 9 

Punkte mit Mindestabstand Bins Platz? , Math. Ann. 123 (1951), 96-124. 
[W41] D. V. Widder, The Laplace Transform, Princeton Math. Ser. 6, Princeton University 

Press, Princeton, NJ, 1941. 
[Y92] V. A. Yudin, Minimum potential energy of a point system of charges (Russian), Diskret. 

Mat. 4 (1992), 115-121; translation in Discrete Math. Appl. 3 (1993), 75-81. 

Microsoft Research New England. One Memorial Drive, Cambridge, MA 02142 
E-mail address: cohnOmicrosof t . com 

Department of Mathematics, Harvard University, Cambridge, MA 02138 
E-mail address: wooSmath. harvard. edu 



